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- - ' Abstract. We show that any bounded zero- angular momentum solution for the Newtonian three-body 

^ , problem must suffer infinitely many eclipses, or collinearities, provided that it does not suffer a triple collision. 

' Motivation for the result comes from the dream of building a symbolic dynamics for the three-body problem, 

CNJ , one whose symbols 1, 2, 3 representing the three types of eclipses. The proof involves the conformal geometry 

-4-^ ' of the shape sphere. 

■ 1. Infinitely Many Eclipses. 

I A solution to the Newtonian three-body problem suffers an eclipse when the three bodies, taken to be 

CS| ■ point masses, become collinear. The solution is bounded if the distances between bodies remains bounded 

' by a fixed constant for all time. 

Theorem 1. Every bounded solution of the three body problem with zero angular momentum and no triple 

' collisions suffers inhnitely many eclipses. 

1^ I Mark Levi conjectured this theorem during a conversation with the author in 1998. 

■ The Lagrange solutions show that the theorem fails if we omit the zero angular momentum condition. 
^ I In these solutions the three bodies form an equilateral triangle at every instant. Bounded Lagrange solutions 

I— with non-zero angular momentum exist for all time, and for all mass distributions. They suffer no eclipses, 

^ I nor triple collisions. 

^ ■ The theorem allows binary collisions in which case we use Levi-Civita regularization to analytically 

I continue the solution through the binary collision, which counts as an eclipse. The only obstruction to 

i infinite time existence for a three-body solution is triple collision. As long as the solution suffers no triple 

g : coll..,„„, it can be c„„..„„cd a.al,t..all, in (.-egulan.ed, ti„e. 

^ . 2. Motivation. 

I Eclipses come in three types, labelled 1, 2, and 3 depending on the mass which lies between the other two. 

■""..^ ■ An eclipse sequence is an infinite sequence in the letters 1, 2, and 3. We may associate to each collision-free 

I solution its eclipse sequence. If the solution is periodic modulo rotations then its eclipse sequence is periodic. 

■ The free homotopy type of a curve which is periodic modulo rotation, whether a solution or not, is encoded 
^ I by its periodic eclipse sequence. Is every free homotopy realized by a collision-free periodic-modulo-rotation 
• • I solution? In other words, does every periodic eclipse sequence arise as the eclipse sequence of some such 

. 5^ I solution? Wu-Yi Hsiang asked me this question in 1996. It helped lead to the rediscovery of the figure eight 

. solution ( Chenciner and Montgomery [2000]), a solution with eclipse sequence 123123. More generally, we 

i-j I can ask is every infinite eclipse sequence realized by a solution? When we attempt to realize a given eclipse 



sequence by the direct method of the calculus of variations, the solutions we obtain (if any) are forced to 
have zero angular momentum. See Montgomery [1998]. This leads us to ask the following closely related 
questions. Is the set of collinear states a kind of a slice for the zero-angular momentum three-body dynamics? 
If so, does this slice lead to a symbolic dynamics in the symbols 1, 2, and 3? Theorem 1 is a partial answer to 
the slice question since it asserts that every zero angular momentum bounded orbit without triple collision 
must intersect the alleged collinear "slice" an infinite number of times. 

3. Intuition and Shape Space 

Shape space is the space of oriented congruence classes of triangles in the plane. It is homeomorphic to 
M^, but is not isometric to it. (See section 11.) We will use spherical coordinates {R, cj), 9) on shape space. 
R measures the overall size of the triangle, and is related to the triangle's moment of inertia / (formula in 
next section) by R^ = I. The variables (0, 9) coordinatize a two-sphere which we call the shape sphere and 
whose points represent oriented similarity classes of triangles. Any motion of the three bodies projects to 
the motion of a single point in this shape space. When that motion is a zero angular momentum solution to 
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Newton's equation then this shape; spac;e motion is defined by a seeond-order differential equation in shape 
space which itself has the form of a Newton's equations, but now in shape space. Under the homeomorphism 
of shape space with Euclidean three-space, the set of collinear triangles is represented by the xy plane. 
The origin of shape space represents triple coUison. Within the collinear plane, and issuing forth from the 
origin, lie three rays whose points represent the binary collision configurations. The zero angular momentum 
Newton's equation written on shape space says that the three binary collision rays exert an attractive force 
on the moving point. Since the rays lie in the collinear plane, this force is always directed towards this plane. 
Levi conjectured, arguing from mechanical intuition, that the point is obliged to either oscillate up and down 
across the collinear plane or escape to infinity. 

4. An oscillatory area. 

The proof of theorem 1 is based on a differential equation for a certain normalized signed area z of the 
triangle formed by the three bodies, and described by theorem 2 below. The signed area A of the triangle 
whose vertices are xi , X2 , X3 is 

A = -n • (x2 - xi) X (x3 - xi) 
where n is the normal to the plane of the triangle. Define a normalized signed area by 

_ 4 A 

where ^ 

-^1 = 3 (''12 + ^23 + ^31) with r„- = \x^-Xj\ 

would be the moment of inertia of the triangle with respect to its center of mass provided the masses m, 
of its vertices were all 1. h is to be compared with the triangle's true moment of inertia 

I = Im = (mim2ri2 + m2m3r23 + msmirl-^) / {mi +m2+ ms). 

The subscript m ~ (nzi, 7712, m^) indicates the mass distribution of the three bodies. There are constants c, C 
such that cli < I < CI\. The motion is bounded if and only if there is a constant C, such that I{t) < C* 
for all time t. The motion has a triple collision at time t if and only if I{t) = 0. 

The variable z lies between —1 and 1, with z = ±1 if and only if the triangle is Lagrange, i.e. equilateral. 
It will be related to the spherical coordinate 4> mentioned briefly in the preceding section hy z = sin(0). The 
solution suffers an eclipse at time t if and only if z{t) = 0. Thus theorem 1 asserts that z{t) has infinitely 
many zeros. 

The zero-angular momentum Lagrange solutions, or Lagrange homothety solutions plays a central role 
in our work here. In these solutions an equilateral triangle shrinks by homothety to a point in finite time, 
thus ending in triple collision. 

Theorem 2. The normalized area variable z satisfies the differential equation 

= (1) 

along any zero-angular momentum solution to the three body problem. The functions f and q are smooth 
nonnegative functions, with f a strictly positive of shape alone, while q is a function of shape and velocities 
which is positive except along initial conditions for the Lagrange homothety solution where it is zero. 

Explicit formulae for the functions / and q of theorem 2 are 

/ = 3mim2m3/i /(mi + m2 + 7713)/ = IX (2) 

and 

1 cos(0) 1 9A, ^, -9 9/,xA9x cos(d)) dU 
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with 



A = 3mi'm2m.sli / (nil + 1112 + ms)/^, 



with ^, 9 certain spherical coordinates on the shape sphere described in section 9, </> being related to z by 



and U = U{I, cp, 6) being the negative of the usual Newtonian potential, viewed as a function on shape space. 
The difficult part of the proof is establishing the positivity of q. 

Corollary to the proof of theorem 1 . The normalized height function z{t) of a zero angular momentum 
solution, bounded or not, has exactly one criticial point between any two successive zeros, i.e. successive 
eclipses, and this is a nondegenerate critical point. In particular, if the zeros occur at ti and t2 with ti < t2 
and if tc is the critical point, then z{t) is strictly monotonic on the subintervals ti <t <tc and tc <t <t2. 

5. Proof of Theorem 1. 

We prove theorem 1, assuming theorem 2. An eclipse is a zero of z, so wc must show that z has infinitely 
many zeros. Equivalently, we show that on any infinite interval a <t < +00 there is a zero of z. 

Restrict attention to the case z{t) > 0. The argument for z{t) < proceeds in an identical manner 
except that the signs of z and its derivative z are to be reversed. Wc first show that if z{ti) > and < 
then at some later time t2 > ti we must have z{t2) = 0. Next we will show that if z{t) > then eventually 
for some later time t^ > t we must have z{t^,) < 0. Together, these facts show that z{t) has a zero some 
finite time later, and complete the proof. 

So suppose that that z{ti) > and z{ti) < 0. Write i = j{fz) and integrate over the interval ti < s < t 
to obtain 



that /(s)i(s) is monotone decreasing over any time interval on which z is positive. That is. /(.s)i(s) < 
f{ti)z{ti) := —6 < for s > ti, as long as z{s) is positive. The boundedness of our solution and hence of I, 
the fact that A is a continuous positive function on the sphere, and the fact that f = IX (see eq. (2)) together 
imply that / is bounded. So there is a positive constant K such that < f{t) < K along our solution. Then 
1// > 1/K and —1// < —1/K. Consequently z = {fz)/f < —S/K over our interval of positivity of z. Now 
suppose that z{t) remains positive over the interval ti < s < t2. It follows from our integral equation for 
z{t) and the inequality immediately above that 



This inequality together with z{t) < 1 forces ^(^2) to be negative as soon as <2 — ii > K/6. Consequently z 
must have a zero within the time K/S. 

It remains to show that there must be a time at which i is negative. This is equivalent to showing 
that it is impossible for a collision-free bounded zero-angular momentum solution to simultaneously satisfy 
z{t) > and z > over an infinite time interval a < t < 00. We argue by contradiction. Suppose we have 
such a solution. Since i > for all t> a. the function z is positive and monotone increasing over the whole 
infinite interval, and so tends to its supremum in infinite positive time. But z is bounded by 1, so that we 
must have i ^ 0. Again f = XI is bounded. It follows that the limit of /i as t ^ 00 must be zero. We 
now show that the limit of limt_>(x) z{t) = 1, which is to say, that the limiting shape is Lagrange's equilateral 
triangle. For suppose not. Then z is everywhere positive and bounded away from Lagrange. Recall that 
the coefficient function q of the differential equation (1) is non-negative and contimioiis, and is zero if and 
only if the shape is Lagrange and the initial conditions are those of Lagrange homothety solution. It follows 
that if limtz(i) < 1 then q > c everywhere along our solution, for some positive constant c. Now use the 
differential equation (1): ^{fz) = —qz. Since q > c > and z > z{a) > the right hand side of this 



z = sin(0) 





z{t2)<z{ti)-{6/K){t2-ti). 



3 



differential equation is strictly negative and bounded away from zero by the negative constant —cz{a). This 
contradicts limj^oo fz = 0. 

Now we know that 2; — > 1 monotonically as t — > c» while fz decreases monotonically to zero. The first 
fact says the configuration approaches the Lagrange equilateral shape. We will now show that there arc 
times tj tending to infinity for which the corresponding velocities approach those of the Lagrange homothcty 
solution. Integrating the differential equation (1) of theorem 2 from t = atooo and using limj^oo f{i)z{t) — 
we obtain q{s)z[s)ds = — /(ti)i(ti). It follows that q{s)ds is finite. This implies that the liminf of q 
as t — + 00 is 0. Thus there are time intervals tj — + 00 over which q[s) is as small as we please. (We 

have not excluded the possibility that limt^oo sup q{t) > 0.) During these intervals of small q the solution is 
nearly tangent to the Lagrange homothety configuration, since this is the only place in phase space where q 
is zero. In other words, the w-limit set of our solution curve contains points of phase space which are initial 
conditions for the Lagrange homothety solution. 

It follows that our solution contains arcs which follow the Lagrange homothcty solution arbitrarily 
closely, and hence come arbitrarily close to the Lagrange triple collision. We now use the results of Moeckel 
[1983] on the linearization of the flow near Lagrange triple collision. He performs a McGehee-type blow- 
up to add the triple collision states as a boundary to phase space. The Lagrange triple collision point 
becomes a hyperbolic rest point of the resulting vector field, and the Lagrange homothety solution lies in 
its stable manifold. We have seen that our solution curve comes arbitrarily close to the saddle point, but 
does not lie on its stable manifold, since if it did it would suffer a triple collision. It follows that the solution 
curve has near-collision hyperbolic shaped arcs in which it closely follows the stable manifold of the saddle 
point, coming very close to the point, then makes a sharp turn and follows the unstable manifold to exit 
a small neighborhood of the point. Consequently its distance in phase space from the saddle point must 
decrease. We will now show that the distance in configuration space from the Lagrange point must also 
increase. Indeed, near triple collision the unstable manifold of the Lagrange point is transverse to the fibers 
of the projection [con figuration, velocity) {configuration). This transvcrsality follows from the same 
transversality for the negative eigenspace of the linearized fiow at Lagrange point. See Moeckel [1983], pp. 
228-229. Consequently, the spherical distance of our solution from the Lagrange point must increase. This 
distance can be measured by 1 — z. Thus z must decrease hence we must have i < somewhere, as desired. 

QED 

6. Proof of the Corollary. Consider again the case z > Q. We saw in the proof of theorem 1 that 
once i < then z continues to decrease monotonically until it crosses zero. Thus it can have only one 
local maximum, on one side of which it is monotone increasing and the other side of which it is monotone 
decreasing. At this maximum we have i = 0. At such a critical point of z eq. (1) of theorem 2 reads 
fz = —qz. It follows that ^ < at this maximum, since / and q are positive. QED. 

7. Reduced dynamics. 

The proof of theorem 2 boils down to computing Newton's equations of motion for the three bodies using 
good coordinates on shape space. Newton's equations are the Euler-Lagrange equations for the Lagrangian 

l=Ik+u 

where K = mi||a;i||^ -|- m2||a;2||^ + ?7i3||a;3||^ is twice the kinetic energy, and U = m\m2/r\2 + Tnim2/ri3 + 
m2ms/r23 is the negative of the potential energy. Here Xi, i = 1,2,3 denote the positions of the three bodies, 
Xi are their velocities, and = ||a;i — Xj\\ is the distance between body i and body j. 

Shape space is homeomorphic but not isometric to Euclidean three-space. Introduce spherical coordi- 
nates {R, (j), 6) on shape space, with 

i?2= J, 

and (p being the colatitude, taken so that cj) = is the equator. Then (Chenciner-Montgomery [2000], 
Montgomery [1998]) 

K = R^ + ^{4>'' + C0S2(<^)^2-) ^ I j|2/^2 ^ ||p||2/^. 

This decomposition of K sometimes goes under the name of Saari's decomposition. The first term 
represents dilational kinetic energy. The last two terms represent the kinetic energy of rotation and of 
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translation. J is the total angular momentum. P is the total linear momentum. M the total mass. The 
second term (i?^/4)(0^ + cos-^ {(pjO^) of K represents deformations of the similarity class of the triangle. Let 
us write 

SO that this second, "pure shape" part of K is (R'^ /4)Kshape- Kshape corresponds to twice the kinetic energy 
of a free particle on a unit sphere. That sphere is the shape sphere, the sphere whose points represent oriented 
similarity classes of triangles. 

The negative of the potential U can be expressed as 

u = u{<i), e)/R 

where U is a, function on the sphere. 

To obtain the three-body equations in the case of angular momentum zero, we set P and J to zero, and 
compute the resulting Euler-Lagrange equations. 

8. Proof of theorem 2 in the case of equal masses. 

We proceed with the proof of theorem 2 in the equal mass case. What makes this case special is that it 
is the only mass distribution for which the Lagrange points coincide with the North and South poles of the 
shape sphere. Then the height 

z = sin((j)) 

above the equator is the variable of theorem 2, where R, </>, are the spherical shape coordinates of the 
previous paragraph. The Lagrangian for the zero-angular momentum motion is 

Lc = {l/2)R? + ^(0' + cos\cj>)0^) + -ic/(<A, 9) 
The Euler-Lagrange equations for cj) are ^(|^) = or 

d i?2 I QIJ 

-2 1 du. 

= -z{-cosm -^-^^}. 

Andi = cos(,/.)0sothat ^(^f i) = cos{<t>)i{^4') + {^4>)^^^ = cos{^)i{^'i^)-^sin{^)^\ Combining 
this equation with the previous one and looking back at the expression for K shape yields: 



where 

q = Kshape - 4 



,2 ^ cos{4>) dU 



sin{(j)) d(j). 

We must show that g > 0, with g = if and only if we are at the Lagrange shape z = ±1, with the velocity 
{R, (j), 6) satisfying (j) = 9 = 0. Clearly 

^ shape ^ 

with equality if and only if ^ = ^ = 0. It remains to show that 

cos{(j)) dU ^ ^ 
sin{4>) d(j) ~ 

with equality if and only ii z = ±1. We postpone the proof of the last inequality since we will need it for 
any mass distribution, and our proof will be independent of mass distribution. See (INEQ2) and its proof 
below. 
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9. Conformal geometry of the shape sphere; height variables. 

The variable z of theorem 2 is a function on the shape sphere. The two key properties of this variable z 
which we used in the proof of theorem 1 are that its zero locus is the equator of collinear configurations, and 
that its critical points arc the Lagrange points. In the equal mass both properties arc satisfied by the height 
function above the equator, z = sin((/)), where ^ is the signed distance of a point on the sphere from the 
equator. The North and South poles (the points a maximal distance from the equator) of the shape sphere 
coincide with the Lagrange points if and only if all the masses are equal. Consequently, the height function 
above the equator fails to satisfy the second key property in the case of unequal masses, and we are forced 
to make another choice of the variable z. 

In the case of general masses, wc take z to be the height function as it would be defined if all the 
masses were equal. This variable satisfies the two key properties, but complicates the kinetic energy of the 
Lagrangian. We must understand this complication. The crux of the matter is that this choice of z is 
tantamount to applying a conformal transformation to the shape sphere which takes the Lagrange points to 
the North and South poles, while mapping the equator to itself. This conformal transformation arises via a 
canonical conformal transformation from the m-sphere to the equal mass distribution sphere. 

The shape space is defined to be the space of oriented congruence classes of triangles, while the shape 
sphere is the space of oriented similarity classes of triangles. In other words, shape space is the quotient 
of the three-body configuration space (IR?)^ by the group of orientation preserving isometrics, while the 
shape sphere is the quotient of (M^)'^ \ {triple collisions} by the group of orientation preserving similarity 
transformations. As topological spaces, neither space depends on the choice of masses. The shape space is 
homeomorphic to Euclidean three space, while the shape sphere is homeomorphic to a two-sphere. 

The triple collisions get mapped to a distinguished point of shape space, called the triple collision point, 
or origin. The action of dilation fixes this point, while changing all other points of the shape space. The 
shape sphere can be canonically viewed as the shape space minus this triple collision divided by the action 
of dilations. 

A choice m = {mi, 1712,1713) of masses defines a kinetic energy metric on the three-body configuration 
space. This in turn induces a metric on the shape space, since the shape space is the quotient of the 
configuration space by a group of isometrics. The shape sphere can be realized as the set of all points in 
shape space a distance 1 from triple collision, and from here the shape sphere inherits a metric as well. We 
denote this metric by d^Sm. The shape sphere with this metric is isometric to the standard round metric on 
a sphere of radius 1 /2 in Euclidean space. We then have that the metric on shape space is given by 



. This expression accounts for the kinetic energy of the previous section. 

The shape sphere has a conformal structure which is independent of the kinetic energy, i.e. is indepen- 
dent of the mass distribution. This conformal structure is implicit in the work of Albouy-Chenciner [1998]. 
We will need the explicit conformal factor A relating two kinetic energy metrics on the sphere. 

Proposition. TJie shape metrics (fsm and d'^Sm' for two different mass distributions m and m' are confor- 
mally related according to the formula 



We will take for coordinates on the shape sphere standard spherical coordinates (/), 6 for the equal mass 
distribution m' = (1, 1, 1) metric. Thus d^Sm' = dcfy^ + cos{(f>)'^ dO'^ . When wc write the metric for d'^Sm in 
these coordinates we get d^Sm = \{(t>,0){d(tP' + cos{(j))^ d9^) with A = c{m)I^/c{m')I^, as in the theorem, 
where c(m) is the total mass divided by the product of the masses. Recalling that the metric defined by the 
mass distribution m on the three-dimensional shape space is dR^ + {E? / A)d^ Sm where E? = Im, we see that 
the kinetic energy on shape space, which is obtained by setting the total linear and angular momentum to 
be zero (P = J = in the expression for K of the previous section) is 



dR^ + {l/2fR^d^s, 



1711+1712+ 1713 j2 ,2 



171117127113 




m'l + m'2 + TO3 

m[m'2m'^ 




K — + —K shape 
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with 

Kshape^{X{(^.e){4>'' +COs'{cj>e'') (2). 

The proposition implies that the shape sphere has a fixed conformal structure, independent of choice 
of masses. The group of orientation-preserving conformal automorphisms of the sphere is the same as the 
group of orientation-preserving, circle-preserving transformations. Thus it makes sense to speak of circles 
on the shape sphere without specifying any mass distribution. 

Lemma [on circles]. Write Si = r|j, where ijk is a permutation of 123 for the squared side lengths 
of a triangle. And write A = in • (x2 — Xi) x (xa — Xi) for its signed area. Then the linear equation 
Asi + Bs2 + Css + DA = with A, B, C, D real constants, describes a circle in the shape sphere, provided 
the set of triangles satisfying the inequality is nonempty. Conversely, every circle in the shape sphere is 

described by such an equation. 

The proofs of proposition and the lemma are postponed to after the proof of theorem 2. 
10. Proof of theorem 2, unequal mass case. 

The proof begins by computing the Euler-Lagrange equations in our special coordinates. The compu- 
tation is as for the equal mass case, the main difference being the occurence of A in the Lagrangian. We 
compute the Euler Lagrange equations for (p, and then for z = sin(0). We have Lagrangian L = {1/2)K + U 
where K is given by equation (2) above. The Euler-Lagrange equation for (j) is then 



Using this equation and z = sin{(f)), so that z = cos{(p)^ as in the equal mass computation, and expanding 



out 4-AR^Xz) yields 



where 



lcos(^19A 2 , cos(0) dU 

2 sm(0) \d(f) sm(0) ocp 



Now 

^shape ^ 

with equality if and only if all the kinetic energy is in the dilational (i?) motion. To conlcude the proofs 
then, we require that 

(1-^|^§>0 ilNEQl) 
sm{(p) 2 \o(p 

and 

-^§>0 iINEQ2) 
sm((pj 0(p 

for < < 7r/2, and for the mass distribution as given. 

Note that both U and A are even functions of cj) by reflectional symmetry. The derivative of any function 
/((/), . . .) which is an even function of tp must be zero at (j) = 0, and consequently is smooth through 

(j) = 0. It follows that both and ^ are smooth functions through the equator. 

Proof of Inequality 2. The inequality (INEQ2) is valid for all mass distributions. Since cos{(f)) / sin{<t)) 
is an odd function, positive fovQ < (j) < 7r/2, and since ^ is also odd, it suffices to show that — is positive 

in the range < < tt/2. 

The proof of the positivity of — ^ is elegant but tricky. Introduce as coordinates in shape space 
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for ijk any permutation of 123. Then 



TT / 1/2 1 / 1/2 , / 1/2 

U=m\m2/s^ +ni3mi/s2 +TO2TO3/S]^ 

while 

I = {mim2S3 + m3miS2 + m2m3Si) /M (LI). 

with M = TOi + m2 + TOs. To differentiate with respect to we fix 7 and 9, thus defining meridianal circles 

passing through the Lagrange point, and then differentiate along these meridianal curves. The crux of the 
inequality is to observe that each of these meridianal curves is defined by a linear constraint 

Asi + Bs2 + Cs3 = (L2) 

when written in terms of the Sfe. Here A,B,C are any real constants, not all zero, but summing to zero. 

To see the validity of this representation of the meridianal curves, use the lemma of the previous section. It 
says that any circle in the shape sphere can be expressed in the form Asi + Bs2 + Css + DA = 0. Now the 
meridianal circles pass through the two Lagrange points L+ and L_,and any circle passing through these 
two points is a meridianal circle. The Lagrange points are characterized by si = S2 = S3, while their signed 
areasare are negatives of each other: A(L+) = — A(i_). Writing Si = s and A = A(L-i-) we see that the 
the coefficients defining the circles satisfy {A + B + C)s + DA = and {A + B + C)s — DA = 0. Neither s 
nor A are zero. Subtracting the two equations yields D = 0. Adding them yields A + B + C = 0. 

Since l/s^^^ is convex for s > 0, C/ is a strictly convex function in the positive coordinate orthant 
Sfc > 0. The constraints (LI) and (L2) are linear, so upon restriction, U is again a strictly convex function. 
Consequently, with the constaints imposed, U has at most one global minimum. But (cither of) the Lagrange 
point L (i.e L_|_ or L_) is the global minimum of U when we impose only constraint (LI). (Note that A does 
not occur in the constraints or in the expression for U. In essence we are also allowing reflections when we 
ignore A and use only the Si as coordinates on the shape space.) All the lines defined by (L2) pass through 
L. Consequently, U restricted to the line (meridian) defined by both constraints (LI) and (L2) has a unique 
minimum at L and is strictly increasing as we move away from it. The variable (I) monotonically decreases 
as we move away from L toward the equator. This proves that 

dU 

for all (f) with < ^ < 7r/2. 

Proof of Inequality 1. 

We can rewrite the desired inequality as 

cos((/)) d , f 
sm(0) 0(p 

where 

/ := I/h 

and where I have used the fact A = CIi/P so that —^j-^^ — +^^0(7/. 
To compute this logarithmic derivative of 7, define variables 

Si := Si/h = r|fe/7i, 

so that ^ 

7 = — TO1TO2S3 + m.3TOlS2 + 1712171331, 

where M = mi + m2 + TO3. We need to be able to differentiate s, with respect to </>. This is easy once we 
have the representation: 

Si = l-cos{ct>hi{e) (3) 
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which we now explain, following Chenciner- Montgomery [2000], pp. 890-891, or the end of the appendix 
here. 

We can represent a point in shape space as a 3-vector w in Euclidean 3-space which we express in 
spherical coordinates as 

w = /i(cos((/>) cos(^), cos((/>) sin(0), sin(0)). 
Then Ji = ||w||, while A = 7i sin(^) = a signed area, and 

Sfe :=r?- = |w| - w-bfc, 

where ijk is a permutation of 123, and where the are three unit vectors on the equator ^ = which 

represent the binary collision rays. These three unit vectors are arranged at the vertices of an equilateral 
triangle circumscribed in the unit circle. Write u = (cos(0), sin(0), 0) and 

7fe(^) := u- bfe. 

Then we have that Sk = h — h cos(^)7fc {6) and the equation (3) for the Sj follows immediately. 
Writing 

Pk = minij/M > 

we have 

I = SpjSj. 

Using the expression (3) for Sj we compute: 

d 

g^logl = T,pk sin(0)7fc/SpfeSfe. 

It follows that 

■ '^^°9i = Sj3feCos((?!))7fe/Epfe(l - cos((/))7fe, 
sm((p) 0(p 

and 

1 + "^^ii^^ai = ^Pk/^Pk{i - cos(</>)7fe)- 

sm(0) 0(p 

Now use the fact that | co&{4>)^k\ < \lk\ < 1 and that at least one of the |7fc| is less than 1 to conclude that 
the previous expression is finite and positive. 
QED 

11. Proofs of the proposition and the lemma ;Conformal Geometry. 
We will give two different proofs of proposition, and one proof of the lemma. 

11.1. Proof of the proposition via Jacobi coordinates. 

Write E — IB? x IR^ x IB^ for the configuration space of the three-body problem. The ith Euclidean 
plane factor represents the positions of the ith body. Write points of ii^ as a; = {x\,X2,Xz) € E with 
Xi e IB?. Identify JB? with the complex numbers W in the standard way so that E = W^. The Jacobi map 
J7m associated to the mass distribution m = {mi, 1712,1713) is the linear map 

•Jm : S — > (T^ 

given by 

Jm{xi,X2,X3) = {z-i,Z2) 

where 

Zl = ^/JIl{x2 - Xi), 
Z2 = y/Jl2{x3 - {{miXl + m2X2) / (mi + m2)) 
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and 

1 _ 1 1 
/Lti mi m2 ' 

1 _ 1 1 
fj,2 mz mi+m2' 

Physically zi is the normalized edge vector joining 1 to 2, and Z2 is obtained by normalizing the vector which 
joins the center of mass of this edge to the remaining vertex. 

The Jacobi map is invariant under translations: Jm{{xi + v,X2 + v,xz + v) = Jm{xi,X2-iXz). It 
diagonalizes the kinetic energy 

K ■= TOillxif + m2||i;2f + msUxsf 
= pif + ||i2f 

provided the total linear momentum is zero: mixi +7712X2 +777^x3 = 0. Similarly, it diagonalizes the moment 
of inertia tensor: 

/ := mi||a;i||2 + m2||a;2|p + msUxslP 

= \\zir+\\z2r 

provided the center of mass is at the origin mia;i + m2a;2 + m^Xz = 0. 

The action of the group of orientation preserving similarities on triangles x becomes, under the Jacobi 
map, the action of complex scalar multipHcation: (21,22) i-^- (Azi, AZ2), A G (T, A 7^ 0. Thus the shape sphere 
is identified with the complex projective line (TIP^, the space whose points are complex lines in W"^. The 
quotient map 

sends a nonzero complex vector (21,22) to the complex line 77(21,22) = [21,22] which it spans. The map 

TioJra '■ -E\ {triple collisions} WIP^ = S"^ sends a triangle x € E to its "shape" meaning oriented similarity 
class. Note that we must delete the triple collisions Xx = X2 = x^ because they form the kernel of the Jacobi 
map. 

If we now repeat the procedure with a different mass distribution m' = {m'i,m'2, m'z) we obtain different 
Jacobi coordinates wi,W2, which diagonalize the new moment of inertia Im' ■ 

We abstract the situation described above. Consider a complex two-dimensional vector space (T^ with 
its standard complex structure. This vector space represents the space of Jacobi coordinates. Write (DIP^ 
for the corresponding complex projective line. It is the quotient of (U^ \ {0} by the action of complex scalar 
multiplication. 

A Hermitian inner product on (T^ induces a metric on (TIP^ as follows. Write I{z) = (2, 2) for square 

norm for this Hermitian inner product. Setting 7=1 defines a three-sphere Sj with induced Riemannian 
metric coming from the real part of the Hermitian innerproduct. . The subgroup C (U* preserves /, 
and the inner product, and hence acts on Sf by isometrics. Consequently the quotient Sf/S^ inherits a 
Riemannian metric by declaring the submersion 5*^ Sj /S^ to be a Riemannian submersion. The quotient 
space Sj /S^ is canonically identified with (UIP^ by sending the S'^-orbit of a point 2 € 5*1 to the corresponding 
(D* orbit. In this way, we obtain a Riemannian metric (Psj on WIP^. If (21, 22) are Hermitian orthonormal 
coordinates so that / = |2ip + |22p, and if 2 = 21/22 are the corresponding afBne coordinate on (DIP^, then 

dsi = \dz\/{l + \z\'^) for 7= |2i|^ + |22|^ 

Consider another Hermitian inner product, with corresponding square norm 7'. We then have another metric 

dsp = \dw\/{l + \wf) for 7' = [wi]^ + |«;2|^ 

on the same projective space, but now with afBne coordinate w = wx/w2- The proposition becomes a special 
case of 



10 



Theorem 3. Let I and I' be the square norms for two different Hermitian structures on the same complex 
two-dimensional vector space. Let (UIP^ be the projectivization of this vector space, and let dsj and dsj' be 
the two metrics on this projective space induced by our two Hermitian inner products. Let L -.V ^ V be a 
linear operator intertwining the two norms: I{Lz) = I'{z). Then the two metrics are related by 

dsr = \det{L)\{I/I')dsi. 

Proof of theorem 3. From basic linear algebra, the complex linear intertwining map L of the theorem 
always exists. It is found by choosing orthonormal coordinates (21,-22) for /, expressing the inner product 
for I' as a matrix in these coordinates, and then diagonalizing this matrix. If 



L = 

then 



a b 
c c 



Wl = azi + bz2 

W2 = czi + dz2 

are orthonormal coordinates for the Hermitian inner product with square norm /'. The corresponding affine 
coordinates z = zxjz^ and w = w\/w2 are then related by the linear fractional transformation 

w = {az + b)/{cz + d). 

We compute 

dw = {ad — bc)dz/ (cz + rf)^. 

(We ask our gentle reader to please bear with us and not be confused by the two meanings of the letter "d" 
here.) Setting D = \ad — bc\ = \det{L)\, we have 

\dw\ l + l^p D\dz\ 1 



1 + l + |u;|2|c0 + d|2l + |2|2 

_ l + D\dz\ 

" \cz + d\^ + \az + b\^ 1 + \z\^ 

|22p + kiP D\dz\ 
\cz\ + dz2\'^ + \azi + 1 + l^^l' 



/' 1 + |2|2 

In the third line we multiplied both the numerator and denominator of the first fraction by |-22p. QED 

Completion of the Proof of the proposition. Theorem 4 tells us that d^s„i' = C{l'^) / {l'^,)d'^ Sm 
and that the constant C is given by C = \det{L)\^ where L is an intertwining operator taking to To 
complete the proof of the proposition we solve for L so as to obtain the correct constant C. 

Fix the triangle x = (.xi, .X2, 2:3) € £" = IR^ x IR^ x IR"^, the configuration space of the three-body 
problem. Then it has two images z and w in (T^ according to the Jacobi maps for the two different mass 
distributions m, and m.' . Write z = Jm{x) and w = Jm'{x). 

We look for a linear map L : (F^ ^ such that w = Lz. Make the upper triangular anzatz L{zi, Z2) = 
{azi, f3zi+jZ2). Using the above expression for the Jacobi map, the ansatz leads to the two linear equations 
azi = Wl and f3z-i + 72:2 = W2, or 



ay^{x2 - xi) = V/xi(a;2 - xi), 

and 

Py/]Ii{x2 - xi) + 7-v/M2(a;3 - (mixi + m.2X2)/ (mi + m2)) = V/^C^s - {m\xi + rr^2Xi)l {m'l + mj)). 
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The first equation lias a = ^ f*-"" ^ solution. Expanding out the seeond equation in .xq. .'E2, 2^3 and 
equating coefficients yields a system of three homogeneous equations, in the two unknowns [i and 7. The X3 
equation has 7 = ^ as a solution. Using this 7, the x\ equation has /3 = —y/jiljjii{mi/{m,i + 77x2) — 
m'l/ {m'l + TO2)) foi' solution, while the X2 equation fi = ^ ^I'^j li.\(vn-2.j (m\ + m-i) — m'2/ {m'l + m'^)) has for 
solution. These two (3s are checked to be equal, and so we get our invertible linear operator 

We have det{L) = a^f = y^/I5747Mi7*2- Plugging in the formulae for the fi in terms of the masses leads to 
MiA*2 = 17111712171^/ {mi + m2 + ma) := c(m). Consequently det{L) = ^/c{m')/c{m). Finally, plugging in 
m, = (1,1,1) yields the formula of the proposition. 

11.2. Invariant theory. 

In order to obtain another proof of Theorem 3, we search for a metric-independent geometric interpre- 
tation of expression I^d^Sm- This alternative point of view will also yield a simple proof of the lemma on 
circles. 

Consider the vector space V of planar triangles modulo translation, i.e. [IR^)^ modulo translations. 
V in a, complex two-dimensional vector space, which is to say a real vector space endowed with an almost 
complex structure J, but with no canonical inner product. The inner product must await the introduction 
of masses. J rotates triangles by ninety degrees counterclockwise. The circle group acting on triangles 
by rotation consists of the transformations exp{9J), 9 real. 

Consider the real vector space V of real quadratic S'^-invariant polynomials on V which are invariant 
under the action of the circle group. V is also a four- dimensional real vector space. One choice of basis for V 
consists of the the squared side lengths Sk = r?- and the signed area A. Another choice of basis is obtained by 
choosing complex linear coordinates, for example Jacobi coordinates, ^1,2:2 for V. Then l^^ip, |-22p and the 
real and imaginary parts of Z1Z2 form a basis for V. If {z, w) = ZiWi + Z2'W2 denotes the standard Hermitian 
form relative to these coordinates, then we can identify V with the space H of two-by-two Hermitian matrices. 
For any invariant / can be expressed uniquely in the form 

I{z) = {z,Hz) 

for some unique Hermitian matrix H Q Ti. 

Every S'^^-invariant function / is expressible as a function in the quadratic invariants. It follows that if 
we know the values of a point v £ V on a, basis for P, then we know the S'^-orbit of v. Let P* be the vector 
space dual to P. For v gV, define a linear functional ev{v), the evaluation map, on P by: 

ev{v){Q)=Q{v). 

This evaluation map is a canonical map 

ev.V ^ P*. 

and according to what we have just said, its image is a realization of the quotient space V/S^, i.e. of "shape 
space" . 

Lemma. The image ev{V) of the evaluation map is isomorphic to the quotient space V/S^. This image is 

the positive half of a quadratic cone in the vector space P* , the cone being defined by the vanishing of a real 
quadratic form of signature (3,1). Consequently, P* and P are endowed with canonical Minkowski inner 
products, denonted (3{v,w), unique up to scale. 

A choice of basis for P is a system of linear coordinates on P* . The cone of the lemma can be described 

as a quadratic relation between the elements of the basis. If we choose for basis the squared side-lengths 
Sk = r^j , together with the signed area A of the triangle, then the cone results from Heron's relation 

16A^ = (ri2 -I- r23 -|- r3i)(ri2 + r23 - r-3i)(r23 + rsi - ri2)(r3i + ri2 - r23). 
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Expand the right hand side to obtain 

ISA^ = 2siS2 + 2s3Si + 2S2S3 - (si + sl + si) ; Si>0 

which describes the positive half of the cone of the lemma. If instead we use the basis | zi p , | Z2 P , -Re (zi Z2 ) , Im{zi 
then the cone results from the relation ( | zi P | Z2 P = I ^2 P • Alternatively, take the basis Wq = | (| zi p + 1 Z2 p ) , 
Wi = ^(l^ip — |z2p)- W2 = Re{ziZ2); W3 = I'rn{ziZ2)- Then the positive cone is given by Wq = wf + w| +w'^, 
Wo >0, a relation which holds among the functions at all points of V. (This relation is familiar from the 
Hopf map.) If we use the coordinates zi, Z2 to view V as H, then we can also identify V* with H using the 
trace pairing to identify H with H*. In these coordinates: 

ev{zi,Z2)ij = Hij{z) := z^zj. 

and the cone is defined by the relation 

det{H) = ; tr{H) > 0. 

The group GL{V; J) of linear transformations of V which commute with J acts linearly on the invariants 
by pull-back, and hence acts linearly on V*. By construction, this action preserves the quadratic coneand 
so is an action by means of the linear conformal Lorentz group CSO+{/3) = CSO{3, 1)+. Here the subscript 
+ denotes the time orientation preserving part of the full Minkowski isometry group, and the S denotes the 
orientation preserving part. If we fix a complex volume element in V , and hence restrict GL[V; J) to SL(V), 
the action just defined is the well-known 2 : 1 homomorphism SL{2, (D) SO{3, 1). 

Now let us projcctivize, which is to say, divide by dilations. These dilations correspond to scaling 
similarities of our triangle. Now the set of rays in the light cone in Minkowski space forms a two-sphere. 
This is our shape sphere. The action of GL(y, J), which factors through CSO+{f3) as we have just seen, is 
an action on this sphere by conformal transformations. Now we are ready to prove the theorem 3. 

Second Proof of Theorem 3. 

Fix a representative Minkowski structure fi on V* , one whose cone C = {p : [3{p,p) = 0} is our 
quadratic cone. The restriction j3c to the cone is a degenerate metric of signature (2,0). If {x,y,z,t) are 
standard Minkowski orthonormal coordinates for {P* , (3) then (3 = dx^ + dy^ + dz^ — dt^ while C is defined 
by + + z'^ — P = 0. Write = + + z^. Write rf^cr^ for the restriction of (3 to the two-sphere r = 1 
in the space-like hyperplane t = 0. We compute 

/3c = r^d^at = t^d^cjt. 

More generally, if r is any time-like linear coordinate then 

7-2 

Pc = 7 rd <Jr- 

where the numerical constant (r, r) is the Minkowski length of the dual vector t £ P. To see this, write 
t = CT where (r, r) = 1/c, thus defining a unit time-like linear coordinate which can be completed to form a 
system {x, y, z, t) of Minkowski orthonormal coordinates. In this formula, rf^cr^ is again the restriction of /3 
to the unit sphere in the space- like Euclidean hyperplane r = 0. 

The square norm I for any Hermitian inner-product on ^ is a linear time- like coordinate on P*. Thus 
if /, /' are two such square norms we have: 

We are almost done. It remains to evaluate the constant {I',I')/{I,T). If H is the Hermitian matrix 
representing I in some system of coordinates, then H' = LHL* represents /' where L is the intertwining 
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operator. But wc have seen that a choice for the Minkowski inner product is {I, I) = det{H), and det{H') = 

\det{L)\^det{H), so that {r,r)/{I,I) = \det{L)\^ . QED 

Remark. A choice of square norm / fixes a normaUzation of the Minkowski inner product (3 by declaring 
that (/, /) = 1. With this normahzation, the shape space metric on the cone is jfic + dP. 

10.3. Proof of the lemma on circles. Circles on a sphere are obtained by intersecting the sphere 

with planes. Think of the sphere as the projectivizcd cone in Minkowski space. Realize this sphere as in the 
second proof of theorem 3 by intersecting the quadratic cone in 7-"* with the three-dimensional afSne space 
{/ = 1}, where I is the square norm for a J-compatible inner product on V. The Sj and A form linear 
coordinates on T'*, and so by restriction any three of them form linear coordinates on the afEne space 7=1. 
The planes in this affine space are defined by a linear equation in the Sj and A. 
QED 
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